Feeds to Scour
SubscribedAll
Scoured 9562 posts in 481.3 ms
Building a PDF Ingestion Pipeline with TypeScript, Wasp, and AI OCR
dev.toยท3dยท
Discuss: DEV
๐Ÿ“„Document Digitization
Preview
Report Post
Show HN: Ragctl โ€“ document ingestion CLI for RAG (OCR, chunking, Qdrant)
github.comยท3dยท
Discuss: Hacker News
๐Ÿค–Archive Automation
Preview
Report Post
Retrotechtacular: IBMโ€™s The World of OCR
hackaday.comยท1d
๐Ÿ“„OCR
Preview
Report Post
TRUNAJOD: A text complexity library for text analysis built on spaCy โ€” TRUNAJOD 0.1.1 documentation
trunajod20.readthedocs.ioยท9h
๐Ÿ“Parsing Grammars
Preview
Report Post
The art of text (rendering) (39c3)
cdn.media.ccc.deยท9h
๐Ÿ–‹Typography
Preview
Report Post
AI Has Made it Easy to Own Your Tools
jimmyhmiller.github.ioยท1d
๐Ÿค–Archive Automation
Preview
Report Post
The Complete Guide to Streaming LLM Responses in Web Applications: From SSE to Real-Time UI
dev.toยท22hยท
Discuss: DEV
๐ŸŒŠStreaming Systems
Preview
Report Post
Turning images into structured signals for modern search
visualquerypro.comยท1hยท
Discuss: Hacker News
๐Ÿค–AI Curation
Preview
Report Post
Benchmarking and Enhancing VLM for Compressed Image Understanding
arxiv.orgยท2d
๐Ÿง Learned Compression
Preview
Report Post
What I Learned Building a Storage Engine That Outperforms RocksDB
tidesdb.comยท2hยท
๐Ÿฆ€Rusty Databases
Preview
Report Post
Joint 2D-3D-Semantic Data for Indoor Scene Understanding
dev.toยท3hยท
Discuss: DEV
๐ŸŒArchive Topology
Preview
Report Post
Building an AI Document Processing Pipeline on AWS (Textract + Bedrock)
dev.toยท11hยท
Discuss: DEV
๐Ÿ”„Archival Workflows
Preview
Report Post
Exploring TabPFN: A Foundation Model Built for Tabularย Data
towardsdatascience.comยท9h
๐Ÿ“ABNF Parsing
Preview
Report Post
The Transformer Architecture: A Deep Dive into How LLMs Actually Work
dev.toยท4hยท
Discuss: DEV
๐Ÿ“Text Parsing
Preview
Report Post
Show HN: BrandRetina โ€“ screenshot similarity API for spear-phish detection
brandretina.aiยท1dยท
Discuss: Hacker News
๐Ÿ”—Binary Similarity
Preview
Report Post
SearchResearch (12/24/25): Living in an AI world that kinda, sorta works for OCR
searchresearch1.blogspot.comยท3dยท
๐Ÿ‘๏ธOCR Evolution
Preview
Report Post
faradayio/xsv2: Fork of xsv because qsv is too much
github.comยท9hยท
Discuss: Hacker News
๐Ÿ—œ๏ธLZSS Variants
Preview
Report Post
Natural language processing for word sense disambiguation and information extraction
arxiv.orgยท15hยท
Discuss: r/compsci
๐Ÿ“ฅFeed Aggregation
Preview
Report Post
A local first context engine for Cursor, Claude Code and more
repobase.devยท1dยท
Discuss: Hacker News
๐Ÿ”„Sync Engine
Preview
Report Post
Document Parsing with LLMs: From OCR to Structural Understanding.
alamedadev.comยท3d
๐Ÿ“‹Document Grammar
Preview
Report Post